Skip to main content

The Archive Team Just In Time Grabs

The hardest part about our transient, shallow world wide web is the terrifying swiftness in which data disappears. To this end, Archive Team members have often bravely strapped on miner's helmets and flashlights, dove into the flaming wreckage of a dying site, and grabbed a copy for all of time. Some of these rescues, consisting of what we could grab, are being saved here.

Please Note: Some of these items were not burning as brightly or recently as others - they might be merely considered "off-site backups" of sites or collections, but in most cases the original data is now gone.

1,534
RESULTS



PART OF
Archive Team
Web Crawls

TOPIC
archiveteam 1,353
m.wsj.net 387
www.theguardian.com 237
theguardian 158
world articles 158
news.bbc.co.uk 96
techcrunch.com 63
www.thingiverse.com 63
g4tv.com 39
news.kbs.co.kr 27
webcrawl 26
revision3.com 25
www.theblaze.com 25
images 23
hackaday 18
hackaday.com 18
engadget 17
www.engadget.com 17
arstechnica.com 16
torrentbytes.net 14
groklaw.net 13
theregister 13
theregister.co.uk 13
www.tuaw.com 13
groklaw 12
wow.joystiq.com 12
www.joystiq.com 12
massively.joystiq.com 9
arstechnica articles 8
arstechnica images 8
engadget articles 8
engadget images 8
groklaw articles 8
imageboard 8
originaltrilogy.com 8
www.reuters.com 8
Microsoft Research 7
anime 7
g4tv 7
home.swipnet.se 7
isohunt.com 7
manga 7
msrvideo.vo.msecnd.net 7
www.chillingeffects.org 7
thefeed 6
www.blackcats-games.net 6
www.scoop.co.nz 6
video xml data 5
www.cbsnews.com 5
forums.winamp.com 4
groklaw pdfs 4
underground-gamer 4
underground-gamer.com 4
www.d-addicts.com 4
abcnews.go.com 3
amkon.net 3
aots blog 3
archive 3
attack of the show 3
cultofmac.com 3
episodes 3
games 3
glenn beck 3
old-dos.ru 3
theblaze 3
theblaze.com 3
torrentfreak.com 3
Archive 2
Internet forums 2
Massive Robot 2
Tomas M 2
andriasang.com 2
anonymous 2
atarimuseum 2
atarimuseum.com 2
audio 2
database 2
detroit 2
detroiturbex 2
detroiturbex.com 2
firefly 2
ftpgrab 2
g4tv.com/images/ 2
gazellegames.net 2
gigaom.com 2
gopher 2
happypenguin.com 2
images.poms.omroep.nl 2
internet society 2
katproxy.com 2
kickass torrents 2
kompleteclub.com 2
mp3 2
npo.nl 2
reddit 2
slax 2
slax.org 2
social 2
thebox.bz 2
torrentfreak 2
tv.revision3.com 2
twaud.io 2
twaudio 2
usenet 2
video pages 2
wiki 2
winamp 2
www.atlus.com 2
www.g4tv.com 2
www.parentdish.co.uk 2
www.pspminis.com 2
www.regretsy.com 2
www.torrentbytes.net 2
"The Swish," Laura Swisher 1
"Weezy," Louise Palanker 1
4chan 1
Agenda 21 1
Archive Team 1
ArchiveTeam 1
Archiveteam 1
Atul Chitnis 1
BBC 1
BID 1
Bjorn3D 1
Blam Entertainment Group 1
BoingBoing 1
CNN 1
Captioning 1
Chris Kerodin 1
Code History 1
Discussions 1
Dutch 1
ET3RN4L 1
Forum 1
GIT Repository 1
ICCU 1
Italia 1
Kernel history 1
LLiNK 1
Lebbeus Woods 1
Libraries 1
Linux 1
Linux History 1
MediaWiki 1
NOS 1
NOTW 1
NPO 1
News 1
News Shows 1
News of The World 1
OPAC 1
Radio 4 1
SBN 1
Salon 1
Servizio Bibliotecario Nazionale 1
Something Awful 1
Stackoverflow 1
Substainable Development 1
Table Talk 1
The Linux Dome Game Tome 1
Tom Merritt 1
Transcriptions 1
Transcripts 1
UN 1
UNIMARC 1
Vince Flynn 1
WARC 1
Webgrab 1
abit 1
academic paper 1
affinities 1
amgia 1
amgiahistory.co.uk 1
api.cnet.com 1
apple2history 1
apple2history.org 1
arcademic 1
archive.glennbeck.com 1
archive.linuxgizmos.com 1
archiveteam web 1
archiveteam web human rights 1
archiveteam web newspaper montreal quebec canada 1
aspergerfoundation 1
aspergerfoundation.org.uk 1
att 1
attlabs 1
atulchitnis.net 1
bad-influence 1
bad-influence.co.uk 1
bitgamer 1
bitgamer.su 1
blog 1
blog.everyblock.com 1
blogcastfm.com 1
blogging 1
brimsonlaboratories 1
burzynskiscam.com 1
catholiceducation.org 1
cats 1
cave-stg 1
LANGUAGE
SHOW DETAILS
Archiveteam: The Patch.Com Grab
38
ITEMS
262,375
VIEWS
262,375
Patch reports on everything you need to know about your town, from local government to school news to what to do with your family this weekend. And your local Patch makes it easy for you and your neighbors to connect and post your news and events too. All of this, plus comprehensive listings of local restaurants and shops, home improvement services and businesses, events, and more – all in one place – in over 1,000 communities and counting.
This item contains regular captures of Dutch news websites in screenshot and WARC format.Dit item bevat de homepages van Nederlandse nieuwswebsites als screenshot en in WARC-formaat.Websites:nos.nlteletekst.nos.nlrtlnieuws.nlnu.nltelegraaf.nlmetronieuws.nlspitsnieuws.nlvolkskrant.nlnrc.nltrouw.nlparool.nlfd.nlrefdag.nl
This item contains regular captures of Dutch news websites in screenshot and WARC format.Dit item bevat de homepages van Nederlandse nieuwswebsites als screenshot en in WARC-formaat.Websites:nos.nlteletekst.nos.nlrtlnieuws.nlnu.nltelegraaf.nlmetronieuws.nlspitsnieuws.nlvolkskrant.nlnrc.nltrouw.nlparool.nlfd.nlrefdag.nl
This item contains regular captures of Dutch news websites in screenshot and WARC format.Dit item bevat de homepages van Nederlandse nieuwswebsites als screenshot en in WARC-formaat.Websites:nos.nlteletekst.nos.nlrtlnieuws.nlnu.nltelegraaf.nlmetronieuws.nlspitsnieuws.nlvolkskrant.nlnrc.nltrouw.nlparool.nlfd.nlrefdag.nl
This item contains regular captures of Dutch news websites in screenshot and WARC format.Dit item bevat de homepages van Nederlandse nieuwswebsites als screenshot en in WARC-formaat.Websites:nos.nlteletekst.nos.nlrtlnieuws.nlnu.nltelegraaf.nlmetronieuws.nlspitsnieuws.nlvolkskrant.nlnrc.nltrouw.nlparool.nlfd.nlrefdag.nl
While testing Version 2 of the Archive Team Warrior virual download appliance, a selection of Tumblr blogs were downloaded over the course of a day. This 133gb collection of mostly random sites are being stored here in the off-chance later generations want to check them out, or if something obscure was caught before being properly archived later.
This item contains regular captures of Dutch news websites in screenshot and WARC format.Dit item bevat de homepages van Nederlandse nieuwswebsites als screenshot en in WARC-formaat.Websites:nos.nlteletekst.nos.nlrtlnieuws.nlnu.nltelegraaf.nlmetronieuws.nlspitsnieuws.nlvolkskrant.nlnrc.nltrouw.nlparool.nlfd.nlrefdag.nl
This is a download of http://forum.nos.nl/, the online discussion forums of Dutch public broadcaster NOS. It contains messages posted in 2005, 2006 and 2007 by visitors of the NOS website. Discussion topics include news, politics and NOS programmes. Downloaded June 2011. -- The archive is available in several formats: * a copy of the HTML page of each topic (including every message posted on the forum) * an XML file for each topic, providing the messages extracted from the HTML * a wget...
Topics: forum.nos.org, NOS, Forum, Archive
The Archive Team Just In Time Grabs
by archiveteam
17,263
0
0
What is this about? This wiki is a catalog of the tricks of the trade for writing fiction. Tropes are devices and conventions that a writer can reasonably rely on as being present in the audience members' minds and expectations. On the whole, tropes are not clichés. The word clichéd means "stereotyped and trite." In other words, dull and uninteresting. We are not looking for dull and uninteresting entries. We are here to recognize tropes and play with them, not to make fun of them....
The WOXY.COM site announced a shutdown of the website itself, months after ceasing operations as an online operating radio station. This heretrix crawl of the website contains the full WOXY.COM site as grabbed around October 28, 2011. From the WOXY.COM about page: As Dustin Hoffman said in the movie Rain Man, "97X--Bam!--The Future of Rock & Roll"...launched our Alternative (Modern Rock) format in September, 1983. Entrepreneurs Doug and Linda Balogh had bought Oxford, Ohio's WOXY...
The Archive Team Just In Time Grabs
by stillflying.net
12,355
0
0
This is a panic download of stillflying.net as of 2012-09-05. This is is a firefly fanfiction site that set out to complete season 1 and season 2 of firefly by writing episode scripts for it. There is a also a image dump warc.gz included.
Topics: stillflying.net, stillflying, firefly, archiveteam
Akoha, the world's first 'social reality game' announced on August 2, 2011 that it would be closing on August 15, 2011. In their own words: "Akoha is built on the basic premise that special little moments make life more awesome. We embrace this idea by offering hundreds of real-world missions, which are simple real world activities. These missions are meant to encourage users to discover new experiences, capture them, and share them with friends." This is a Heritrix crawl of the...
Archive Team panic download of Gamepro Forums, December 2, 2011. Collected in WARC format.
This is a Heritrix crawl of http://llink.nl/, the website of Dutch public broadcasting association LLiNK, made on 23 and 24 June 2011. It includes the main website as well as the programme-specific websites of LLiNK radio and television programmes. The crawl logs and order file are available in llink-20110624-crawl-logs.tar.bz2 -- The MD5 checksums of the files are: 050b714c6df98a29bdb6c1ff077c6953 llink-20110623100606-00000.warc.gz 49de8110b71ba8da7607eafbfe80fd50...
Topics: LLiNK, Dutch, Archive, Webgrab, NPO
This item contains regular captures of Dutch news websites, in screenshot and WARC format.Dit item bevat de homepages van Nederlandse nieuwswebsites, als screenshot en in WARC-formaat.
The Montreal Mirror was a free weekly English-language newspaper which was published between 1985 and 2012. On June 21, 2012, its publisher announced that it would cease publication immediately. This archive contains archived copies of the newspaper's website between 1997 and 2010, as published at the URL http://www.montrealmirror.com/potatoes/archive.html, and was archived on June 30, 2012.
Topic: archiveteam web newspaper montreal quebec canada
A content grab of the City of Heroes Website (different than the forums) on October 5, 2012.
The Archive Team Just In Time Grabs
by underground-gamer.com
6,115
0
0
This is the external images linked in underground gamer forums as of 2012-09. I downloaded this over 3 days starting from 2012-09-22 to 2012-09-24. Its in 3 warc.gz files. This should help make a more complete picture of underground gamer forums.
Topics: underground-gamer.com, underground-gamer, archiveteam
The Archive Team Just In Time Grabs
by andriasang.com
6,008
0
0
This is a panic download of andriasang.com as of 2012-12-24. This is a japan gaming website but this is the english part of it.
Topics: andriasang.com, archiveteam
The Archive Team Just In Time Grabs
by isohunt.com
5,624
0
0
This is a panic download of isohunt.com/forum/ external images as of 2013-10-21.
Topics: isohunt.com, archiveteam
This is a image dump of all images links in my thebox.bz articles dump. This is was made on 2013-01-15.
Topics: thebox.bz, archiveteam
The Archive Team Just In Time Grabs
by andriasang.com
5,549
0
0
This is a panic download of andriasang.com images as of 2012-12-26. I grabbed the image urls from my articles dump. There is over 317k images in these warcs.
Topics: andriasang.com, archiveteam
The Archive Team Just In Time Grabs
by underground-gamer.com
5,159
0
0
This is all images from underground-gamer.com as of 20120921. There is a warc.gz and a tar.gz. I also have a text file list of urls that i got.
Topics: underground-gamer.com, underground-gamer, archiveteam
The Archive Team Just In Time Grabs
by arstechnica.com
4,610
0
0
This a mirror of all article urls from arstechnica.com with 2009 urls path in them.
Topics: arstechnica.com, arstechnica articles, archiveteam
WARC BBC Radio 4 iPM podcast website snapshots, 9 October 2013 to 8 November 2013. Archive by ET3RN4L
Topics: BBC, Radio 4, iPM, archive, snapshot, WARC, ET3RN4L
The Archive Team Just In Time Grabs
by isohunt.com
4,289
0
0
This is a panic download of isohunt.com/forum/ index pages as of 2013-10-17.
Topics: isohunt.com, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
4,284
0
0
This is a panic download of 2001-10 world articles from theguardian.com as of 2013-08-25.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
This is an archive of The Department Store Museum, an online museum of America and Canada's independent department stores. The content was downloaded on 2011-10-17 from http://departmentstoremuseum.blogspot.com/. From the site's introduction: Welcome to the Museum. The Department Store Museum is an on-line homage to America's great,late-lamented department stores. There is an extraordinary amount of information about many of these stores - logos, floor directories, ads, etc., and it is hoped...
A last minute grab of ds.gamespy.com before it shuts down.
Topics: webcrawl, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
3,552
0
0
This is a panic download of 2001-11 world articles from theguardian.com as of 2013-08-25.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by www.theguardian.com
3,541
0
0
This is a panic download of 2010-06 world articles from theguardian.com as of 2013-09-15.
Topics: www.theguardian.com, theguardian, world articles, archiveteam
The Archive Team Just In Time Grabs
by kat.ph/blog/
3,410
0
0
This is a panic download of kat.ph/blog/. This is a bittorent/pirate website. The blog part of kat.ph is what i backup. I also did a image dump of all images on this blog. The image dump is also in warc.gz format too.
Topics: kat.ph/blog/, kickasstorrents, archiveteam